Network Structures from Selection Principles 
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We present an analysis of the topologies of a class of networks which are optimal 
in terms of the requirements of having as short a route as possible betw^een any tw^o 
nodes w^hile yet keeping the congestion in the network as low^ as possible. Strikingly, 
we find a variety of distinct topologies and novel phase transitions between them on 
varying the number of links per node. Our results suggest that the emergence of 
the topologies observed in nature may arise both from growth mechanisms and the 
interplay of dynamical mechanisms with a selection process. 
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There have been many exciting recent developments 
m y? 13] in understanding the topologies of many natural 
and artificial networks. The analysis of network topology 
is carried out using classic concepts such as clustering Q , 
the distribution of the number of links from each node 
(called the degree) j2, i3, _5| and its small world charac- 
ter la, LD| • Strikingly, many of the observed topologies 
are quite distinct from those expected for generic ran- 
dom networks 4, 8\ . There has been important progress 
0, y, la, S ll^ in rationalizing the existence of non- 
universal scale-free networks (the degree distribution ex- 
hibits a power law behavior over a finite range with a 
non-universal exponent) by dynamical models entailing 
the growth by node and edge addition (with possible pref- 
erential attachment), rewiring j,2] and edge removal [lOj. 

Our focus here is the proposal and analysis of a class 
of models in which the key selection criterion for network 
topology is optimality. Channel networks formed in river 
basins have been shown to attain, in the steady state 
of their dissipative dynamics epitomized by the general 
landscape evolution equation |lll | , a minimum of total en- 
ergy dissipation |l3| • Strikingly, a variety of robust scal- 
ing features emerge that closely resemble those observed 
for natural landforms |l2l |. and universality classes exist 
depending, for example, on the terrain heterogeneities 
|lj|. Because of the nature of the functional to be mi- 
minized, all trees, i.e. networks with no loops, are lo- 
cal optima and thus prevail over networks which are not 
competitive from an evolutionary viewpoint jlll Il2l |l3J . 
Optimization has been introduced as a possible explana- 
tion of the degree distribution observed in the Internet 
topology 14s| or to investigate the origin of small-world 
networks 15], taking into account the physical distance, 
i.e. Euclidean distance, between the nodes of a spatial 
network. Scale-free networks arising from optimal design 
have been previously studied [l£j. It has been shown 
that the minimization of a linear combination of aver- 
age degree and average distance (the distance between 



two nodes is defined as the minimum number of edges 
traversed to join them) can lead to the emergence of a 
truncated power-law in the degree distribution. 

Our goal is to understand the topology of networks 
which minimize a physically motivated cost function. 
Strikingly, we find a variety of distinct topologies and 
novel phase transitions between them on varying the 
number of links per node. 

Suppose that some type of information has to be com- 
municated between pairs of nodes of the network |a| . It 
is plausible that besides the average distance between 
any two nodes, the type of nodes encountered along the 
path(s) joining them may also matter in the optimiza- 
tion of the dynamics of communication taking place in 
the system. For example, selective pressure may operate 
so as to choose certain nodes because of their high con- 
nectedness - or else to avoid them for the same reason. 
Associated with the type of node, is a local feature that 
depends only on its degree, namely, the number of edges 
rooted in the node. On a global scale, we will distinguish 
among structures that rewire local features at random 
selecting the changes if the new structure provides a se- 
lective advantage. It is well known that in many such 
optimization problems, the key factor that matters is the 
shape of the cost function |l2, 13] . The concavity or con- 
vexity of the cost function can be embodied by a power 
law form with scaling exponent a less than or greater 
than 1 respectively: 
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where i and j are pairs of nodes of the network, and 
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Here P is any path connecting site i to site j of the 
system, p is any node belonging to such a path and kp 



is the degree or connectivity of node p. The weighted 
distance dij{a) is a global quantity associated with the 
pair i,j and is the minimum of the sum of degrees k" 
(a local property), evaluated along the path P from i 
to j, over all the paths connecting i and j. Note that 
in the special case of loopless tree-like structures, such a 
path is unique and dij = X^pgpi^,- ^p • ^^ the limiting 
case a ^ 0, Eq.(|21l becomes the standard definition of 
distance on a network \J\ . The new definition of weighted 
graph distance introduced in Eq. 10) captures the conflict 
between two competitive trends: the avoidance of long 
paths and the desire to skip heavy traffic. 

The networks minimizing the cost eq.(^ are searched 
for among the ensemble containing a fixed number of 
nodes n, as well as the number of links (edges) I. The 
resulting networks are analyzed in terms of the degree 
distribution P(fc), i.e. the fraction of nodes with degree 
k, the average distance between pairs of nodes and the 
average clustering coefficient C = n~^ J2i ^jj where d is 
a measure of how interconnected the neighbors of a given 
node are lZ||: 
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li is the number of links between the neighbours of node 
i and ki{ki — l)/2 is the total number of possible pairs 
that can be formed among them. 

The optimization method used in the numerical simu- 
lations is a Metropolis scheme at zero temperature. The 
goal is to obtain the statistics of all local minima which 
are accessible topologies associated with the chosen dy- 
namics [ll|. 

We have studied several values of a and r — l/n with 
n = 35 — 200. The protocol of the simulation is as follows: 

i) generation of a random initial configuration with 
fixed n and I; 

ii) random rewiring: Specifically, a link connecting the 
sites i and j is randomly chosen and substituted with 
a link from i to a site k, not already connected to i, 
extracted with uniform probability among the sites of 
the system. This ensures that the number of links I as 
well as the size of the system n remains constant during 
the minimization; 

iii) connectedness control: If the graph is not con- 
nected after rewiring, step (ii) is repeated; 

iv) energetic control. The new value of Ha{t -I- 1) is 
calculated. The new configuration is accepted only if it 
is energetically favorable, i.e. only if Ha{t + 1) < Ha{t); 
otherwise the change is rejected and we return to step 
(ii). 

Note that the zero-temperature setting ensures feasi- 
ble optimality of the emerging network structure [l3J, 
a feature that is relevant for dynamical accessibility of 
complex optimal structures. The minimization algo- 
rithm stops after F consecutive failed changes on the 
network; we have chosen F = n{n — 1), so that, on 



average, each pair of vertices is allowed to change its 
state twice. For each case we performed 200 indepen- 
dent simulations, starting with different random ini- 
tial configurations and varying the size n of the sys- 
tem: n = 35,50,70,100,140,200. For each size, the 
different values of the ratio r investigated are: r — 
1.05,1.1,1.2,1.3,2.0,2.3,3.0. 

On varying r, we observe two distinct behaviors. The 
first occurs for values of r ^ 1: the system displays an 
apparent scale-free behavior in P{k) for several values of 
a (see Figure^ for a = 0.7). However, the behavior does 
not seem to be a genuine power law because the sharp 
cut-off does not display the expected dependence on the 
system size n. Unfortunately, the computational cost, 
which grows exponentially with the number of nodes, 
does not permit us to quantify the weak dependence of 
the cut-off on n. As a increases, this apparent scale-free 
region shrinks around the value r — 1 and is vanishingly 
small for a > 1. The second behavior is obtained for 
larger values of the ratio r - the degree distribution ob- 
tained is strongly peaked around the average value of fc, 
< fc > (Figure Q). 




FIG. 1: Degree distribution, averaged over 200 realizations, 
for several system sizes (n = 35, 50, 70, 100, 140) for a = 0.7 
and r — 1.05. The system displays a range of degrees. 
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FIG. 2: Crossover between the two distinct behaviors: the 
heterogeneous regime which exhibits a range of degrees and 
the homogeneous one characterized by a peaked distribution. 
Data are averaged over 200 realizations for a — 0.7, n = 70 
and for several values of r = l/n. 

A sample of network topologies are illustrated in Fig- 
ure (jSJ, for different values of a and r. 

On increasing the value of the ratio r, one moves from 
networks characterized by the presence of some highly 
connected nodes together with many peripheral sites 



(Top Left and Right) to networks in which almost ev- 
ery node has the same degree k —< k > (Bottom Left 
and Right). In addition, a sharp transition is observed 
in terms of the average clustering coefficient C =< Ci >, 
as defined in eq.©. 





FIG. 3: Graph representation of four typical networks with; 
Top Left: a = 0.4, r = 1.05, n = 100; Top Right: a = 
0.7, r = 1.05, n = 140; Bottom Left: a = 0.5, r = 2.0, n = 
50; Bottom Right: a = 2.0, r = 1.05, n = 100. The graphs 
have been produced with the Pajek software. 

For a > 1 (fig. ^ Top) , the system undergoes a clear 
phase transition as the value of the ratio r increases pass- 
ing from a regime characterized by zero clustering to 
one in which the clustering coefficient becomes different 
from zero. The cost function in eq.Q has two compet- 
ing forces: the minimization of the graph diameter and 
the minimization of node degree. When a > I the min- 
imization of node degree dominates and the system at- 
tempts to minimize the degree of each node resulting in a 
peaked distribution around the mean value < k >, with 
a non-trivial topology characterized by zero clustering 
and exhibiting the presence of long loops. (fig.|3lBottom 
Right). When the ratio r reaches the critical value rda), 
one obtains a non-zero clustering coefficient. 

This transition also occurs for a < 1. However, when 
a < I one obtains an additional phase transition at 
r^(a), where the system passes from optimal networks 
exhibiting a non-zero clustering coefficient, to ones with 
no clustering at all. Starting from very small values of 
r, we observe topologies characterized by the presence of 
few interconnected hubs (i.e. sites with very high degree 
0,|l3) finked to many peripheral sites (fig. |3lTop Left). 
Indeed, when a < 1, the tendency expressed by the cost 
function is to decrease the graph diameter, i.e. a measure 
of the mutual distance among pairs of nodes. 

The emergence of this extra phase transition under- 
scores the importance of the concavity (convexity) of the 
cost function. 

The limiting case a — > would correspond to the min- 
imization of the standard graph distance, leading, in the 
region r '^ 1, to a single central hub connected to n—1 pe- 



ripheral nodes, which share the remaining l — n + 1 links. 
This situation leads to non-zero clustering. The mini- 
mization of the graph distance corresponds to a limiting 
case of [13 as well; however, in yjl there is no constraint 
on the number of links I, so that the optimal network 
they find is a clique, in which each node is connected to 
each other. 
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FIG. 4: Mean clustering coefficient for the optimal config- 
uration Copt normalized to the mean clustering coefficient, 
Crand, of the raudom configuration. Top: results for net- 
work size n — 70 and a = 2.0; in the inset the behaviour 
of the ratio Copt/CrandP is shown, where CrandP represents 
the mean clustering of a random graph with the same degree 
distribution P{k) as the optimized network. Bottom: results 
for network size n = 70 and a = 0.35; in the inset (n = 50, 
a = 0.35) both the critical values, rc{ct) and r'^^a), are shown. 



Increasing the ratio r does not favour adding other 
links among the hubs, because their already high degrees 
would only increase further. Hence the system reorga- 
nizes by increasing the number of hubs and automatically 
reducing their degrees, trying to avoid expensive trian- 
gles between hubs. When the transition occurs, at r'^[a), 
the network does not exhibit hubs any more, but tends to 
become quite homogeneous in the sense that almost every 
node has coordination close to the average value < k >. 
Even in this regime the optimal topology is distinctly dif- 
ferent from the random one. In fact, it displays a peaked 
degree distribution around the mean value < k > with- 
out significant clustering (fig. ^Bottom Left). The loops 
formed have the maximum possible length in order to 
reduce the energy function. Adding extra links to the 
network forces the loops to become smaller, still avoid- 
ing clustering up to a second critical value of r, rc{a). 
Beyond this value, 'triangles' appear leading to a tran- 
sition similar to the one encountered for a > 1 (fig. ^ 
Bottom, inset). 



The extent of the clustering phase for r < r'^{a) and 
a < 1 shrinks for increasing values of a; the critical value 
rc{a) decreases as a increases, Va. From Fig. ^I^land 
Fig. ^ one finds that several distinct topologies are ob- 
tained for different values of a and r: a heterogeneous 
regime exhibiting a broad distribution of degrees (r ~ 1, 
a < 1) observable both in the clustering and no cluster- 
ing phase depending on the value of a; a homogeneous 
regime for larger values of r with C 7^ (r > rc{a) Va, 
and a < 1, r < r'^{a) but not in the tree-like limit) or 
C = (a < 1, r^(a) < r < rc{a) and a > 1, r < rc{a)). 

We have also studied the characteristic path length, L, 
defined as the average, over all pairs in the system, of the 
graph distance between pairs of nodes. 
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FIG. 5: Characteristic path length Lopt, 
classical random one Lrand, vs. a. 
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As shown in fig. El in the entire interval of a, the char- 
acteristic path length of the optimal configuration, Lgpt, 
is comparable to or smaller than the random one, Lrand- 
Even though the small network sizes studied here do not 
allow us to reach definitive conclusions, the system seems 
to display a small- world effect \^. 

We have studied the system behaviour in terms of 
mean clustering and average path length in comparison 
to both a classical random graph |j,|8j [Copt I Grand and 
Lopt/ Lrand) and a random graph characterized by the 
same degree distribution P(k) as the optimized network 
(Copt/CrandP and Lopt/Lrandp)- both studics give com- 
parable results (see for example the top inset of fig.01). 

In summary, we have investigated the role of selective 
pressure in determining the topological features observed 
in natural and artificial complex networks. Our work is 
complementary to existing models that either rely on dy- 
namical mechanisms, such as preferential attachment, or 
on topological and geometrical criteria. Optimality leads 
to the emergence of several distinct network structures 
including an apparent scale-free arrangement in the tree- 
like topology limit. Besides the degree distribution, we 
have studied the clustering coefficient and the average 
path length of the selected networks which point to the 
existence of non-trivial phase transitions and to the fea- 
tures of the small-world effect. Our main result is that 
the emergence of the topologies observed in nature may 



not exclusively be the outcome of growth mechanisms 
but may also arise from the interplay of dynamical mech- 
anisms with an evolutionary selection process. 
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